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Sequence 1 : 
Sequence 2 : 
Sequence 3 : 
Sequence 4 : 
Sequence 5 : 
Sequence 6 : 
Sequence 7 : 
Sequence 8: 
Sequence 9 : 
Sequence 10 
Aligning 



mSorLA 
mSort 
VpslOp 
mCS2 
hCS3 
mCS3 
mCSla 
mCSlb 
mCSlc 
: hCSl 



2033 aa 
825 aa 
1577 aa 
1159 aa 
1223 aa 
1219 aa 
1147 aa 
1167 aa 
1178 aa 
1168 aa 



Sequences (1:2) Aligned. Score: 20 
Sequences (1:3) Aligned. Score: 8 
Sequences (1 :4) Aligned. Score: 1 8 
Sequences (1:5) Aligned. Score: 16 
Sequences (1:6) Aligned. Score: 15 

Sequences (1:7) Aligned. Score: 13 

Sequences (1:8) Aligned. Score: 14 

Sequences (1:9) Aligned. Score: 13 
Sequences (1:10) Aligned. Score: 14 

(2:3) Aligned. Score: 16 

(2:4) Aligned. Score: 20 

(2:5) Aligned. Score: 20 

Sequences (2:6) Aligned. Score: 20 

Sequences (2:7) Aligned. Score: 21 

Sequences (2:8) Aligned. Score: 21 

Sequences (2:9) Aligned. Score: 20 
Sequences (2:10) Aligned. Score: 21 

Sequences (3:4) Aligned. Score: 11 

Sequences (3:5) Aligned. Score: 11 

Sequences (3:6) Aligned. Score: 11 

Sequences (3:7) Aligned. Score: 10 

Sequences (3:8) Aligned. Score: 11 

Sequences (3:9) Aligned. Score: 10 

CLUSTAL W (1.82) multiple sequence alignment 



Sequences 
Sequences 
Sequences 



Sequences (3:10) Aligned. Score: 10 
Sequences (4:5) Aligned. Score: 42 
Sequences (4:6) Aligned. Score: 42 

Sequences (4:7) Aligned. Score: 42 

Sequences (4:8) Aligned. Score: 42 

Sequences (4:9) Aligned. Score: 43 

Sequences (4:10) Aligned. Score: 42 

Sequences (5:6) Aligned. Score: 92 

Sequences (5:7) Aligned. Score: 63 

Sequences (5:8) Aligned. Score: 62 

Sequences (5:9) Aligned. Score: 64 

Sequences (5:10) Aligned. Score: 63 

Sequences (6:7) Aligned. Score: 63 

Sequences (6:8) Aligned. Score: 62 

Sequences (6:9) Aligned. Score: 63 

Sequences (6:10) Aligned. Score: 61 

Sequences (7:8) Al igned . Score : 9 8 

Sequences (7:9) Aligned. Score: 97 

Sequences (7:10) Aligned. Score: 91 

Sequences (8:9) Aligned. Score: 96 

Sequences (8:10) Aligned. Score: 93 

Sequences (9:10) Aligned. Score: 90 



mSorLA 

mSort 

VpslOp 



YA QYLWITFDFCST IHGFSIPFRAAD LLLHSKASNLLL GFDRSHPNKQLW KSDDFGQTWI 60 
MILLH 5 



mSorLA 

mSort 

VpslOp 



MI QEHVKSFSWGID PYDQPNAIYIER HEPFGFSTVLRSTDFFQSRENQEVILEEVRDFQL 120 
FV YSLWALLLIPLI NAEEFTPKVTKT IAQDSFEILSFDDSNTLIRKQDAS VTISFDDGET 65 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



RD KYMFATKWHLP GSQQQSSVQLWV S FGRKPMRAAQFVTKHP INEYY I A DAAEDQVFVC 180 

MERPRG AADG 10 

WE KVEGIEDEITWI YIDPFNRHDRAVATSMYESRLYITNDQGKSWERITLPDSEK- - -NI 122 

MA HRGPPSAPKRPG PTAPDRSFQA 24 

-N MEAARTERPAGR PGAPLVRTGLLLLSTWVLAGAEIT WDATGG PGR PAAPASRP PALS P 59 
- - MEAAGTERPAGWPGAPLARTGLLLLSTWVLAGAEIT WGATGGPGRLVS PASRPPVLPP 58 

MG KVGAGDGYS AG - LSALLAGAGLLML LAPGVCSSLSCCPPQHPSSTPR RT 50 

MG KVG AGDGS SAG - LSALLAGAGLLML LAPGVCSSLSCCPPQHPSSTPR RT 50 

MG KVG AGDGS SAG- LSALLAGAGLLML LAPGVCSSLSCCPPQHPSSTPR RT 50 

MG KVGAGGGS QAR - LSALLAGAGLLIL CAPG VCGGGSCCPS PHPS S APR SA 50 

T50->I 



mSorLA 
mSort 
Vps 1 Op 
mCS2 
hCS3 



VSHSNNSTNLYISE AEGLKFSLSLEN VLYYSPGGAGSD TLVRYFANEPFA DFHRVEGLQG 24 0 

LLRWPLGLLLLLQLLPPAAV G QDRLDAPPPPAPPLLRWAGP 51 

SS RGCY I ETHPLNK NY FLAKCNY CE K TEVDNEENSGDEEGAPVIFNITRCTDKVFASNDG 182 

LLPPCWPRSWPLLLLLLVLVAACGAM GRSPQPGRQGPG VQITRLLPAGRT 74 

LS PRAVASQWPEELASARRAAVLGRR AGPELLPQQGGG RGGEMQVEAGGTSPAGERRGRG 119 
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mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



LLPRAAENRWPEELASARRAAAPRRRSRLEPL SQASR GEIR TEAAGMSPEGAR WVPG 115 

LT PRGFPHPGPLGR APATPPPLFMR PLFAVAPG DRAL FLERAGGS R 96 

LT PRGFPHPGPLGR APATPPPLFMR PLFAVAPG DRAL FLERAGGS R 96 

LT PRGFPHPGPLGR APATPPPLFMR PLFAVAPG DRAL FLERAGGS - - - R 96 

ST PRGFSHQGRPGR APATPLPLWR PLFSVAPG DRALSLERARGT G 96 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

roCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



VY IATLINGSMNEENMRSVITFDKGGTWEFLQAPAFTGYGEKINCELSQGCSLHLAQRLS 300 

VGVSWGLR AAAPGGPVPRAG RWRRGAPAEDQD CGRLP 88 

GK SFSEIKSSLE-RNENSAISISDCG FAKTGKDSDLESSDTSIICLFQNM QLIMDEFSS- 240 

ESGDRKDPQAR ESEPSVPGLGPGSASGPSTDGAPAPGKGRRARAVPVAGAASASR-- 129 

IPAPAKLGGARRSRRAQPPITQERGD AWATAPADGSR- -GSRPLAKGSREEVKAPRAGG- 176 
IP SPSQAGSARRTRRAQPPSPLERGD S W AT ALADG AK - -GSRPHTKGSREEVRATRTGG- 172 
VS VATAARSGRRRR SGTEPEKIEPGE GASRSRRDMLKD GGQQGLGTGARD PGKATRFR- - 154 
VS VATAARSGRRRR SGTEPEKIEPGE GASRSRRDMLKD GGQQGLGTGARD PGKATRFR-- 154 
VS VATAARSGRRRR SGTEPEKIEPGE GASRSRRDMLKD GGQQGLGTGARD PGKATRFR- - 154 
AS MAVAARSGRRRR SGADQE KAE RGE GAS RS PRGVLRD GGQQE PGTRERD PDKATRFR- - 154 
"furin? . "furin? . : 

QLLNLQLRRM P I LS KES APGL 1 1 ATG SVG KNLAS KTNV Y I S S S AGARWRE AL PG PHYYTW 360 

DFIA KLTNNTHQHVFDDLSG S VS LS W 114 

PYTES KLVLTTDWGKSL KEFDQFKDKV 267 

AQV SLISTSFVLKGD ATHNQAMVHW 154 

SAAEDL RLPSTSFALTGD SAHNQAMVHW 204 

ASTEEL RLPSTSFALTGD SAHNQAMVHW 2 00 

MEEL RLTSTTFALTGD SAHNQAMVHW 180 

MEEL RLTSTTFALTGD SAHNQAMVHW 18 0 

MEEL RLTSTTFALTGD SAHNQAMVHW 180 

MEEL RLTSTTFALTGD SAHNQAMVHW 180 
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GD HGGIIMAIAQGM ETNELKYSTNEG ETWKTFVFSEKPVFVYGLLTEPGE KSTVFTIFGS 420 

VG DSTGVILVLTTF QVPLVIVSFGQS KLYRSEDYGKN FKDITNLINNTFI RTEFGM 170 

VNGYRILKSHMWI TQGDRYNDMSSM DVWVSNDLSNFK MAYMPTQLRHSM QGEIYEDAMG 327 

TGENSSVILILTKYYH-ADMGKVLESSLWRSSDFGTT YTKLTLQPG VTTVIDNF- - 207 

SGHNSSVILILTKL YD - FNLGSVTESSLWRSTDYGTT YEKLNDKVG LKTVLSYL-- 257 

SGHNSSVILILTKL YD-FNLGSVTESSLWRSVDYGAT YEKLNDKVG LKTVLSYL- - 253 

SGHNSSVILILTKL YD - YNLGS ITES SLWRSTDYGTT YEKLNDKVG LKTILSYL-- 233 

SGHNSSVILILTKL YD - YNLGS I TES SLWRSTDYGTT YEKLNDKVG LKTILSYL-- 233 

SGHNSSVILILTKL YD- YNLGS ITESSLWRSTDYGTT YEKLNDKVG LKTILSYL- - 233 

SGHNSSVILILTKL YD- YNLGS I TES SLWRSTDYGTT YEKLNDKVG LKTILGYL- - 233 
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NK ESVHSWLILQVN ATDALGVPCTEN DYKLWSPSDERGNECLLGHKTVFKRRTPHATCFN 480 



- - AIGPENSGKVI LTAE VSG 

-RIILPMS RERSDQEED 

--YICPAN KRKIILVSSSL- 

--YVNPTN KRKIMLLSD 

- -YVNPTN KRKIMLLSD- -- 

--YVCPTN KCKIMLLTD 

- -YVCPTN KCKIMLLTD 

--YVCPTN KCKIMLLTD 

- -YVCPTN KRKIMLLTD 



188 
343 
224 
272 
268 
248 
248 
248 
248 



mSorLA 
mSort 
VpslOp 
mCS2 



GEDFDRPVWSNCSCTREDYECDFGF KMSEDLSLEVCVPDPEFFGKPYSPPVPCPVGSSY 540 

GS RGGRVFRSSDFA KNFVQTDLPFHP LTQMMYS PQNSDYLLALST ENGLWVS KNF 24 3 

KG IVSEILISDSQG LKFSPIPWTANE VFG-YINLYQPTYLKGTMIASLYPLSRRRNRKGK 402 
GD REQSLFLSTDEG ATFQKYPVPFL VETLLFH PKEEDKVLAYTK DS KLYVSSDL 278 
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hCS3 PEMES S I L I S SDEGATYQKY RLTFY 1 QSLLFHPKQEDWVLAYSLDQKLYSSMDF 326 

m CS3 PE MESSVLISSDEG ATYQKYRLTFY IQSLLFH PKQEDWVLAYSL DQKLYSSMDF 322 

mCSla PE I ES S LL I S SDEG ATYQKYRLNFY LQSLLFH PKQEDWILAYSQ DQKLYSSAEF 302 

mCSlb PE IESSLLISSDEG ATYQKYRLNFY LQSLLFH PKQEDWILAYSQ DQKLYSSAEF 302 

mCSlc PE IESSLLISSDEG ATYQKYRLNFY LQSLLFH PKQEDWILAYSQ DQKLYSSAEF 3 02 

hCSl PE IESSLLISSDEG ATYQKYRLNFY IQSLLFH PKQEDWILAYSQ DQKLYSSAEF 302 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



RR TRGYRKISGDTCSGGDVEARLEGE LVPCPLAEENEFILYAMRKFIYRY DLASGATEQL 600 

G- -EKWEEIHK AVCLAK WGPNNI IFFTTH VNGSCKADLGAL ELWRTSDLGK 292 

AK GVKNKGVTKI SV DNGLTWTVLKW DPDNADS FDCD I TDFENCSLQNMF YTREGSTPTA 462 
G- - KKWTLLQERVT KDHVFWAVSGVDDDPNLVHVEAQD LSGGYRYYTCLI YNCSAQPHIA 336 
G- -RRWQLMHERIT PNRFYWSVAGLD KEADLVHMEVRTTDGYAHYLTCRI QECAETTRSG 384 
G--RRWQLMHERIT PNRFYWSVSGLD KEADLVHMEVRT ADGYAHYLTCR I QECAETTRSG 380 
G- -RRWQLIQESWPNRFYWSVMGSS KEPDLVHLEART VDGHS I YLTCRM QNCTEANRNK 360 
G- -RRWQLIQESWPNRFYWSVMGSS KEPDLVHLEART VDGHS I YLTCRM QNCTEANRNK 360 
G- - RRWQLIQESWPNRFYWSVMGSS KEPDLVHLEART VDGHS I YLTCRM QNCTEANRNK 360 
G--RRWQLIQEGWPNRFYWSVMGSN KEPDLVHLEART VDGHSHYLTCRM QNCTEANRNQ 360 
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PL SGLRAAVALDFG YERNCLYWSDLALDTIQRLCLNGSTGQEVI INSGLE TVEALAFEPL 660 

TFKTIG VKIYSFG LGGRFLFASE 315 

GILMTTG IV GDGSVFDWGDQR TFI SRDGGLTWK LAFDFPCLYAVG DYGNVI VAI P 517 

PFSGP IDRGSLT VQDEYIFLKA 358 

PFARS IDISSLWQDEYIFIQV 406 

PFARS IDISSLWQDEYIFIQV 402 

PFPGY IDPDSLIVQDDYVFVQL 382 

PFPGY IDPDSLIVQDDYVFVQL 3 82 

PFPGY IDPDSLIVQDDYVFVQL 3 82 

PFPGY IDPDSLIVQDHYVFVQL 3 82 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



SQ LL YWVDAG F K K I EVANPDGDFRLT IVNSSVLDRPRALVLVPQEGVMFW TDWGDLKPGI 

MA DK- - - DTTRRIH VSTDQGDTWSMA QLPS VGQEQFYS ILAANEDMVFMH VDE 

YN ADEDDDPQSEFYYSLDQGKTWTEYQLETTIYPNEVM NTTPDGSGAKFI LNG F 



TS • 
XT • 
TI - 
TS - 
TS - 
TS - 
TS - 



- TNRTKYYVSYRRSDFVLM KLPKYALPKDLQ IISTDEQQVFVAVQE- 

- SGRASYYVSYRREAFAQI KLPKYSLPKDMH I I STDENQVFAAVQE - 
- GGRAS Y YVS YRREAFAQ I KLPKYSLPKDMH II STDENQVFAAVQE - 

- GGRPHY YVS YRRSPFAQM KLPKYALPKDMH VI STDENQVFAAVQE - 

- GGRPHY YVS YRRS P FAQM KLPKYALPKDMH VI STDENQVFAAVQE - 

- GGRPHYYVSYRRSPFAQM KLPKYALPKDMH VI STDENQVFAAVQE - 

- GGRPHY YVS YRRNAFAQM KLPKYALPKDMH VI STDENQVFAAVQE - 



720 
365 
571 
405 
453 
449 
429 
429 
429 
429 
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YR SYMDGSAAYRLV SEDVKWPNGISVDS QW I YWTDAYLDC IERITFSGQQ 770 

PGDTGFGTIFTSDDRGIVYSKSL DR HLYTTTGG ETDFTNVTSLR 4 09 

TLAHMDGTTNFI YA I DFSTAFNDKTC EENDFEDWNLAE GKCVNGVKYKIRRRKQDAQCLV 631 

WNQVDTYNLYQSDLRGVRYSLVL EN VRSSRQAEENWIDILEVRGVK 452 

WNQNDTYNL Y I SDTRGI YFTLAM EN IKSSRGLMGNI I IELYEVAGIK 500 

WNQNDTYNLYI SDTRGI YFTLAM EN IKSSRGLMGNI I IELYEVAGIK 496 

WNQNDTYNL Y I SDTRGVYFTLAL EN VRSSRGPEGNVMIDLYEVAGIK 476 

---WNQNDTYNLYI SDTRGVYFTLAL EN VRSSRGPEGNVMIDLYEVAGIK 476 

WNQNDTYNL Y I SDTRGVYFTLAL EN VRSSRGPEGNVMIDLYEVAGIK 476 

WNQNDTYNLYI SDTRGVYFTLAL EN VQS S RGPEGN I M I DL YEVAG I K 476 
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mSorLA RSVILDSLPHP YAIAVFKNEIYWDDWSQLSIFRAS KHSRSQVEILASQLTGLMDMKVFY K 830 



mSort GV YITSTLSEDNSI QSMITFDQGGRW EHLRKP-ENSKC DA 448 

VpslOp KK VFEDLQLFETAC DKCTEADYECAF EFVRDATGKCVP DYN L 673 

mCS2 GVFLAN-QKVDGKV TTVITYNKGRDW DYLRPPSTDMNG KP 4 91 

hCS3 GI FLAN - KKVDDQV KT Y I TYNKGRDW RLLQAPDVDLRGSP 539 

mCS3 GI FLAN- KKVDDQV KTY I TYNKGRDW RLLQAPDVDLRGSP 535 

mCSla GMFLAN- KKIDNQV KTF I TYNKGRDW RLLQAPDADLRG DP 515 

mCSlb GMFLAN- KKIDNQV KTF I TYNKGRDW RLLQAPDADLRG DP 515 

mCSlc GMFLAN- KKIDNQV KTF I TYNKGRDW RLLQAPDADLRG DP 515 

hCSl GMFLAN- KKIDNQV KTF I TYNKGRDW RLLQAPDTDLRG DP 515 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



GK NAGSNACVPQPCSLLCLPKANNSK 

TAKNKNECSLHIHASYSISQ 

IVLSDVCDKTKKKT VPVKPLQLVKGD 

TNCQPPDCYLHLHLRWADNP 

VHCLLPFCSLHLHLQLSENP 

VHCLLPFCSLHLHLQLSENP 

VHCLLPYCS LHLHLKVSENP 

VHCLLPYCS LHLHLKVSENP 

VHCLLP YCSLHLHL KVS ENP 

VHCLLP YC S LHLHL KVS ENP 



S CRCPEG VAS S VLPSGDLMCDC PQG YQRKNNTCV 890 

KLNVPMAPLSEP NAVGI VIAHG 4 90 

KCKKPMTVKS VD I SCEGVPKKG 721 

YVSGTVH --TKD TAPGL IMGAG 531 

YSSGRIS- -SKE TAPGLWATG 579 

YSSGRIS- -SKDTAPGLWATG 575 

YTSGIIA- -SRD TAPSIIVASG 555 

YTSGI I A- - SRD TAPS I IVASG 555 

YTSGIIA- -SRD TAPSIIVASG 555 

YTSGIIA- -SKD TAPSIIVASG 555 



mSorLA KEENTCLRNQYRCSNGNC INS I WWCD FDNDCGDMSDER NCPTTVCDADTQ FRCQESGTCI 950 

mSort 

Vps lOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



PL SYKCDLEDDCGD NSDESHCEMHQC RSDEFNCS SGMC IRSSWVCDGDND CRDWSDEANC 

SVGD AISVMVPDVYIS D DGGYS WAKMLEGPHY 

TNDK EIWTENKFDFK IQFYQYFDTVTDESLLMINSRGEA YISHDGGQTI 

NLGSQLVE YKEEMY ITS DCGHT WRQVFEEEHH 

NIGP ELSYTDIGVFISS DGGNT WRQIFDEEYN 

NIGS ELSYTDIGVFISS DGGNT WRQIFDEEYN 

NIGS ELSDSDISMFVSS DAGNT WRQIFEEEHS 

NIGS ELSDSDISMFVSS DAGNT WRQIFEEEHS 

NIGS ELSDSDISMFVSS DAGNT WRQIFEEEHS 

NIGS ELSDTDISMFVSS DAGNT WRQIFEEEHS 



1010 

522 

771 

563 

611 

607 

587 

587 

587 

587 



mSorLA 
mSort 
VpslOp 
mCS2 



TAI YHTCEASNFQCHNGHCI PQRWAC DGDADCQDGSDEDPVSCEKKCNGF HCPNGTCIPS 1070 

YT ILDSGGI IVAIE HSNRPINVIKFSTDEGQC WQSYVFTQE 563 

RR FDSNGETI I EWFNP YYNSS AYLF GSKGSIFS THDRGYSFMTA 816 

VL YLDHGGVI AAI K DTS I PLKILKFS VDEGHT WSTHNFTST 604 
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hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



VWFLDWGGALVAMKHTPLPVRH LWVSFDEGHS WDKYGFTSV 652 

VWFLDWGGALVAMK HTPLPVRHLWVS FDEGHS WDKYGFTLL 648 

IL YLDQGGVLVAMKHTSLPI RHLWLS FDEGRS WSKYSFTSI 628 

IL YLDQGG VLVAMK HTSLP I RHLWLS FDEGRS WSKYSFTSI 62 8 

IL YLDQGG VLVAMK HTSLP I RHLWLS FDEGRS WSKYSFTSI 628 

VL YLDQGGVLVAMK HTSLP I RHLWLS FDEGRS WSKYSFTSI 628 

SK HCDGLRDCPDGS DEQHCEPFCTRFMDFVCKNRQQCLFHSMVCDGIVQC RDGSDEDAAF 1130 

PI Y FTGLAS E PGA 576 

KLPEARQLGMPLDF N 831 

SVFVDGLLSEPGD 617 

PL FVDGAL VEAGM 665 

PL FVDGAL VEAGV 661 

PLFVDGVLGEPGE 641 

PLFVDGVLGEPGE 641 

PLFVDGVLGEPGE 641 

PLFVDGVLGEPGE 641 



mSorLA 

mSort 

VpslOp 

mCS2 

hCS3 

mCS3 

mCSla 

mCSlb 

mCSlc 

hCSl 



AGCSQDPEFHKECDEFGFQCQNGVCI SLIWKCDGMDDCGDYSDEANCENP TEAPNCSRYF 1190 

RSMNISIW GFTESFITRQWV SYTVDFKDIL 606 

AKAQDTFI YYGGKNCES ILSPECHAVAYLTNDGGE T FTEMLDNA I 876 

ETLVMTVF GHIS- - FRSDWE LVKVDFRPSF 645 

ETHIMTVF GHFS - - LRS EWQ LVKVDYKS I F 693 

ETHIMTVF GHFS - - LRSEWQ LVKVDYKS I F 689 

ETLIMTVF GHFS - -HRSEWQ LVKVDYKS IF 669 

ETLIMTVF GHFS- -HRSEWQ LVKVDYKS IF 669 

ETLIMTVF GHFS- -HRSEWQ LVKVDYKS IF 669 

ETLIMTVF GHFS - - HRSEWQ LVKVDYKS IF 669 



mSorLA 

mSort 

VpslOp 

raCS2 

hCS3 

mCS3 

raCS la 

mCSlb 

mCSlc 

hCSl 



QF HCENGHCI PNRW KCDRENDCGDWS DEKDCGDSHVLP SPTPGPSTCLPNYFRCSSGACV 1250 

ER NCE EDDYTTWLAHSTD PGDYKDGCI 633 

HC EFAGSLFKYPSN EDMVMCQVKEKSSQTRSLVSSTDFFQDDKNTVFENI IGYLSTGGYI 936 

PR QCG EDDYSSWD LTDL QGDHCI 668 

SRHCT KEDYQTWH LLN QGEPCV 715 

SRRCT KEDFETWH LLN QGEPCV 711 

DRRCA EEDYRPWQ LHS QGEACI 691 

DRRCA EEDYRPWQ LHS QGEACI 691 

DRRCA EEDYRPWQ LHS QGEACI 691 

DRRCA EEDYRPWQ LHS QGEACI 691 
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MGTWVCDGYRDCAD GS DEEACPSLANSTAASTPTQLGQCDRFEFECHQPKKC I PNWKRCD 1310 

LG -YKEQFLRLR 644 

IVAVPHE NNELRAYVTID 954 

MG QQRSYRKRK 679 

MG ERKIFKKRK 726 

MG ERKIFKKRK 722 

MG AKRIYKKRK 702 

MG AKRIYKKRK 702 

MG AKRIYKKRK 702 

MG AKRIYKKRK 702 
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GHQDCQDGQDE ANCPTHSTLTCTSREFKCEDGEAC I VLSERCDGFLD CSDESDEKACSDE 1370 

KSSVCQNGRDYWA KQPSVCPCSLED FLCDFGYFRPEN AS ECVE QPELKGHELE 698 

GT EFAEAKFPYDED VGKQEAFTILES EKGSI FLHLATN LVPGRDFGNLLK SNSNGTSFVT 1014 
STSWCVKGRSFTSALTSRVCKCRDSD FLCDYGFERSSSSESTA- -NKCSA NFWFNPLSPP 737 

PG AQCALGRDHSGS WSEPCVCANWD FECDYGYERHGE SQ CVPAFWYNPASPS 779 

PG AQCALGREYSGS WSEPCVCADWD FECDYGYERHGE SQ CVPAFWYNPASPS 775 

SE RKCMQ- - KY AGAME S E PCVCTEAD FDCDYGYERHSNGQ CLPAFWFNPSSLS 753 

SE RKCMQ- - KYAGAMESE PCVCTEAD FDCDYGYERHSNGQ CLPAFWFNPSSLS 753 

- - KYAGAMESE PCVCTEAD FDCDYGYERHSNGQ CLPAFWFNPSSLS 753 



SE RKCMQ - 

SE RKCMQG- KYAGAMESEPCVCTEAD FDCDYGYERHSNGQ - 



-CLPAFWFNPSSLS 754 
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LT VYKVQNLQWTAD FSGDVTLTWMRP KKMPSASCVYNV YYRWGES IWKT LETHSNKTST 

FC LYGKEEHLTTNGYR KIPGDKCQGGM N PAREVK 

LE HAVNRNTFGYVD FEKIQGLEGIIL TNIVSNSDKVAE NKED KQLKTK I T FNEGSDWN- - 



ED CVLGQTYT SSLGYR- 
KD CSLGQSYLNSTGYR- 
KD CSLGQS YLNSTGYR - 
KD CSLGQS YLNSAGYR- 
KD CSLGQS YLNS AG YR- 
KD CSLGQSYLNSAGYR- 
KD CSLGQSYLNSTGYR- 



- KWSNVCEGGV DLQQS PVQLQCPLQAPR- 

- RIVSNNCTDGL REKYTAKAQMCP GKAPR- 
-RIVSNNCTDGL RDKYSAKTQLCP GKAPR- 

- KWSNNCTDGV REQYTAKPQKCP GKAPR- 

- KWSNNCTDGV REQYTAKPQKCP GKAPR- 

- KWSNNCTDGV REQYTAKPQKCP GKAPR- 

- KWSNNCTDGV REQYTAKPQKCP GKAPR- 



1430 

732 

1072 

781 

823 

819 

797 

797 

797 

798 
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VL KVLKPDTTYQVK VQVHCLNKVHNTNDFVTLRTPEGLPDAPRNLQLSLN REEEGVILGH 14 90 

DLKKKCTSNFL NPTKQN 749 

FLKPPKRDS EGKKFFCSSKSL DECSLHLHGYTE RKDIRDTYSS 1115 

GLQVS I RG E AV AVR PREDVL FW RQEQGDVLTT 814 

GLHWTTDGRLVAEQGHNATFI ILMEEGDLQRT 856 

GLHWTTDGRLVAEQGHNATFI ILMEEGDLQRT 852 

GLRIVTADGKL TAEQGHNVTLMV QLEEGDVQRT 830 

GLRIVTADGKL TAEQGHNVTLMV QLEEGDVQRT 830 

GLRIVTADGKL TAEQGHNVTLMV QLEEGDVQRT 830 

GLRIVTADGKL TAEQGHNVTLMV QLEEGDVQRT 831 

WAPPVHTHGLIREY IVEYSRSGSKMWASQRAASNSTEI KNLLLNALYTVR VAAVTSRGIG 1550 

SKSNSVPIILAIVGLMLVTWAG 772 

GS ALGMMFGVGNVG PNLLPYKECSTFFTTDGGETWAEV KKTPHQWEYGDHGGI LVLVPEN 1175 
KYQVDLGDGFKAMY VNLTLTGE P I RHHYES PG I YR VS V RAENMAGHDEAVLFVQVNS PLQ 874 
NI QLDFGDGIAVSY ANFSPIEDGIKH VYKSAGIFQVTA YAENNLGSDTAVLFLHWCPVE 916 
NI QLDFGDGVAVSY ANFSPIEDGIRH VYKSAGI FQVTA YAENNLGSDTAFLFLHWCPVE 912 
LI QVDFGDGI AVS Y VNLSSMEDGI KH VYQNVGI FRVTV QVDNSLGSDS AVLYLHVTCPLE 890 
LI QVDFGDGI AVS YVNLSSMEDGIKH VYQNVGI FRVTV QVDNSLGSDSAVLYLHVTCPLE 890 
LI QVDFGDGI AVS Y VNLSSMEDGI KH VYQNVGI FRVTV QVDNSLGSDSAVLYLHVTCPLE 890 
LI QVDFGDGIAVSYVNLSSMEDGIKHVYQNVGIFRVTV QVDNSLGSDSAVLYLHVTCPLE 891 
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NW SDSKS ITTIKGK VIQAPNIHIDSY DENSLSFTLTMD GDI KVNGYWNLFWSFDAHKQE 1610 

VL I VK KYVCGGR - 784 

SE TDS I S YS TD FG K TW KD Y KFCADKVL VKD I TTVPRDS ALRFLLFGEAAD IGGSSFRTYT 1235 
AL YLEWPVIGVNQ EVNLTAVLLPLN PNLTVFYWWIGHSLQPLLSLDNSV TTKFTDAGDV 934 
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hCS3 HVHLRVPFVAIRNKEWISAVWPSQLGTLTYFWWFGNSTKPLITL DSSISFTFLAEGTD 976 

mCS3 HV HLRVPFVAIRNKDVNISAWWPSQ LGTLTYFWWFGNSTKPLITLDSSI SFTFLAEGTN 972 

mCSla HV HLSLPFVTTKNK EVNATAVLWP SQ VGTLTYVWWYGNNTEPLITLEGSI SFKFTSEGMN 950 

mCS lb HV HLSLPFVTTKNK EVNATAVLWPSQ VGTLTYVWWYGNNTEPLITLEGSI SFKFTSEGMN 950 

mCSlc HV HLSLPFVTTKNK EVNATAVLWPSQ VGTLTYVWWYGNNTEPLITLEGSI SFKFTSEGMN 950 

hcsl HV HLSLPFVTTKNK EVNATAVLWPSQ VGTLTYVWWYGNNTEPLITLEGSI SFRFTSEGMN 951 
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KK TLSFRGGSALSH KVSNLTAHTSYE I S AWAKTDLGDS PLAFEHILTRGSSPPAPSLKAK 

FLVHRYSVLQQHAEADGVEALD STSHAKSGYHDD 

ID FRNI FERQCDFD ITGKESADYKYS PLSSKSNCLFGHQT 

RV TVQAACGNSVLQ DSRLVRVLDQFQ WPLRFS RELDT 

TI TVQVAAGNAL I Q DT KE I AVHE YFQ S QLLS FS PNLDY 

TI TVQVAAGNALIQ DTKEIAVHEYFQSQLLSFSPNLDY 

TI TVQVSAGNAILQ DTKTIAVYEEFR SLRLAFS PNLDD 

TI TVQVSAGNAILQ DTKTIAVYEEFR SLRLAFS PNLDD 

TI TVQVSAGNAILQ DTKTIAVYEEFR SLRLAFS PNLDD 

TI TVQVSAGNAILQ DTKTIAVYEEFR SLRLSFS PNLDD 



1670 
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972 

1014 
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988 

989 
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AI NQT AVE C I WTG P KNWYGI FY ATS FLDL YRNP KS VTT S LHNKTV I VS K DEQYLFLVRV 1730 

SDEDLLE 82 5 

EFLRKTDENC FIGNIPLSEF 1295 

FNPNTPEWR EDVGLWTRL 991 

HNPDIPEWRKDIGNVIKRA 1033 

HNPDIPEWR QDIGNVIKRA 1029 

YNPDI PEWRRDI SRVI KKS 1007 

YNPDI PEWRRDI SRVI KKS 1007 

YNPDI PEWRRDI SRVI KKS 1007 

YNPDI PEWRRDIGRVI KKS 1008 
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LIPYQGPSSDYVW KMIPDSRLPPRH LHAVHIGKTSALIKWESPYDSPDQ DLFYAIAVKD 1790 

SR NIKNCSCTRQDF ECDYNFYKANDGTCKLVKGLSPAN AADVCKKEPDLI EYFESSGYRK 1355 
LS KETS I PEELLVT WKPGLPTIADL YVLLPLPRPTRKRSLTS- -DKRLAAVQQALNSHR 104 9 
LVKVTSVPEDQILIAVFPGLPTSAELFILPPKNLTERRKGNEGDLEQIVE TLFNALNQNL 1093 
L I KVTS VP EDQ I L VAVFPGL PTS AEL F I L P P KNLTERR KGHEGDLEQ I VE TLFNALNQNL 1089 
LVEATGIPSQHILVAVLPGLPTAAELFVLPYQDGTREN KRSPEDLEQISE VLIHKLNQNL 1067 
LV EATGI PSQHI LVAVLPGLPTAAEL FVLPYQDGTREN KRSPEDLEQISE VLIHKLNQNL 1067 
LV EATG I PSQH I LVAVL PGL PTAAEL FVLP YQDGTREN KRSPEDLEQISE VLIHKLNQNL 1067 
LVEATGVPGQHILVAVLPGLPTTAELFVLPYQDPAGEN KRSTDDLEQISE LLIHTLNQNS 1068 
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LI RKTDRSYKVRSRNSTVEYSLSKLE PGGKYHIIVQLGNMSKDSSIKITT VSLSAPDALK 1850 

IPLSTCEGGLKLDAPSSPHACPGK EKE FKEKYS VSAGPFAFI FIS I LLI 14 04 

IS FILRGGLRILVE LRDTDTGP QRPGGSGG-Y WAVWLFVIG 1090 

VQ FELKPGVQVIVY VTQLTLAP LVDSSAGHS SS AMLMLLSW 1135 

VQ FELKPGVQVIVY VTQLTLAP LVDS SAGKSSS AMLMLLSW 1131 

VH FELKPGVQVLVH AAHLTAAP LVDLTPTHSGS AMLMLLSW 1109 

VH FELKPGVQVLVH AAHLTAAP LVDLTPTHSGS AMLMLLSW 1109 

VH FELKPGVQVLVH AAHLTAAP LVDLTPTHSGS AMLMLLSW 1109 

VH FELKPG VRVLVH AAHLTAAP LVDLTPTHSGS AMLMLLSW 1110 
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trans- 
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II TENDHVLLFWKS LALKEKQFNETRGYEIHMSDSAVN LTAYLGNTTDNFFKVSNLKMGH 1910 

I F FAAWFVYDRG I RRNGGFARFGE I R LGDDGLIENNNTDRWNNIVKSGF YVFSNIGSLL 1464 
LFAVGAF I LYKFKRKRPGRTVYAQMHNEKEQEMTS PVSHSEDAQSTMQGNHSGWLS INS 1150 
FVGLAVFLIYKFKRKIPWINIYAQVQHDKEQEMIGSVSQSENAPKITLSD FT-EPEELLD 1194 
FVGLAVFLIYKFKRKIPWINIYAQVQHDKEQEMIGSVSQSENAPKITLSD FT-EPEELLD 1190 

FV GLAVFVI YKFKR CVFLLLP SYPRPPPPSSF CQ-VQKQ- - - 1147 

FV GLAVFVIYKFKRR VALPSPP SPSAQPGDSSLR LQRPRPATPPSS PK-RGSAGAQ 1164 

FV GLAVFVI YKFKRKIPGINVYAQMQNEKEQELINPVSHSESRPSVPHPD LR-RPGQLVD 1168 

FV GLAVFVIYKFKRR VALPSPP SPSTQPGDSSLRLQRARHATPPST PK-RGSAGAQ 1165 

-membrane" .S1149->P 
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NYT FTVQ ARCL FGS Q I CGE P AVLLYDELS SGADAAVI Q AARSTDVAAWVPILFLILLSL 1970 

QHTKTNIAHVISKI RGRFGNRTG PS Y S S L I HDQ FLDEA DDLLAGHDEDAN - - DLS S FMDQ 1522 

RE MHSYLVG 1159 

KE LDTRVIGGIATIANSESTKEIPNCTSV- -'- 1223 

KE LDTRV I GS I AT I AS SESTKE I PNCTSV 1219 

FA I H67 

EKVESQLLGK 111 * 

YAI 1168 

GV GFAILYTKHRRL QSSFSAFANSHYSSRLGSAIFSSG DDLGEDDEDAPM I TG FS DDV PM 2030 
GSNFEIEEDDVPTL EEEHTSYTDQPTTTDVPDALPEGN EENIDRPDSTAP SNENQ 1577 
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